FEAT: Jailbreak Scenario Expansion by ValbuenaVC · Pull Request #1340 · Azure/PyRIT

ValbuenaVC · 2026-01-30T20:16:54Z

Description

Adding more features to the Jailbreak scenario! Major changes:

JailbreakStrategy now supports multiple different attack types via ManyShot, PromptSending, Crescendo, and RedTeaming values.
New attack strategies can be collected using SINGLE_TURN and MULTI_TURN aggregates; PYRIT has been deprecated.
The initializer now accepts k_jailbreaks, num_tries, and jailbreak_names; these allow you to choose a random number of jailbreaks, how many times to try each jailbreak, and to choose which jailbreaks specifically you'd like to use respectively. Note that k_jailbreaks and jailbreak_names are mutually exclusive.
A default adversarial target has been added to support the relevant attack strategies.

Tests and Documentation

Expanded to support new strategies.

pyrit/datasets/jailbreak/text_jailbreak.py

pyrit/scenario/scenarios/airt/jailbreak.py

nina-msft · 2026-02-18T21:01:49Z

pyrit/scenario/scenarios/airt/jailbreak.py

            DatasetConfiguration: Configuration with airt_harms dataset.
        """
-        return DatasetConfiguration(dataset_names=["airt_harms"], max_dataset_size=4)
+        return DatasetConfiguration(dataset_names=["airt_harms"])


Any reason for removing max dataset size? I think we have this set so that our integration tests don't run the entire dataset by default, which would slow it down.

Is there anywhere where the user provides a max prompt number that we could pass through to here if its set, and otherwise if not set we keep at default of 4?

I tried making it a user-provided parameter, but the method implicitly belongs to the Scenario superclass since it's called in initialize_async and only exposed as self._dataset_config per instance. I think this is a good feature for a future scenario refactor but is out of scope here, so I put it back to 4 for simplicity's sake and for the integration tests.

pyrit/scenario/scenarios/airt/jailbreak.py

nina-msft · 2026-02-18T21:14:38Z

pyrit/scenario/scenarios/airt/jailbreak.py

+        all_templates = TextJailBreak.get_jailbreak_templates()
+
+        if jailbreak_names:
+            diff = set(jailbreak_names) - set(all_templates)


curiosity: whoa my brain doesn't compute this logic lol

is the diff = the names that are in jailbreak_names and not in all_templates

could we make the same comparison by checking for name in jailbreak_names if name not in set(all_templates) raise error and this is just a more efficient way of doing that?

You computed it correctly 🙂 but giving it a second look it was really not readable, so I added a comment that explains how it works. The comparison is the same as the one you described but more efficient

pyrit/scenario/scenarios/airt/jailbreak.py

Victor Valbuena and others added 27 commits January 26, 2026 20:06

Scaffolding

022f70a

Precommit

e85cdb9

fixtures and basic tests

fc260c3

basic tests

89a8079

basic tests

b18f224

last test

96ddf6c

jailbreak format test

eb4e936

sample jailbreak prompt

243ea0a

Merge branch 'main' into jailbreak

946fdde

real jailbreaks added

132caf5

Merge branch 'main' into jailbreak

c4e625f

Merge branch 'main' into jailbreak

79d1a64

changing dataset name

cb28fda

moved jailbreak discovery

f399b6d

changed path resolution

75436ea

minor changes

c0022f6

minor bug

9f579f2

Merge branch 'main' into jailbreak

ccf7025

old dataset name

349cc6b

precommit

9fa6430

random jailbreak selection

513cbf3

error handling

b57b35a

error handling docstring

999a0c6

Merge branch 'Azure:main' into jailbreak2

f3ec8bb

scaffolding

89fd8bd

scaffolding for subset

66650a6

scaffolding

fa5b01a

ValbuenaVC changed the title ~~Jailbreak Scenario Expansion~~ [DRAFT] Jailbreak Scenario Expansion Feb 5, 2026

Merge branch 'main' into jailbreak2

44bc05c

ValbuenaVC changed the title ~~[DRAFT] Jailbreak Scenario Expansion~~ [DRAFT] FEAT: Jailbreak Scenario Expansion Feb 5, 2026

ValbuenaVC and others added 13 commits February 12, 2026 11:13

Merge branch 'main' into jailbreak2

827ec0e

params

8168db8

tweaks

5ac7651

dataset_size

20ef0c3

k_jailbreak bug

06bb694

Merge branch 'main' into jailbreak2

03a1e9b

tests

6a67ac4

new strategies

4b441d4

adversarial chat

b14f564

roleplay path

07b6142

roleplay

36b6b95

Merge branch 'main' into jailbreak2

f39aecd

Merge branch 'main' into jailbreak2

a43eeaf